Improving Air Pollution Prediction Modelling Using Wrapper Feature Selection
نویسندگان
چکیده
Feature selection is considered as one of the essential steps in data pre-processing. However, all previous studies on predicting PM10 concentration Malaysia have been limited to statistical method feature selection, and none these used machine-learning approaches. Therefore, objective this research investigate influence variables prediction model by using wrapper compare performance different predict for next day. This uses 10 years daily pollutant concentrations from two stations (Klang Shah Alam) obtained Department Environment (DOE) 2009 until 2018. Six methods (forward backward elimination, stepwise, brute-force, weight-guided genetic algorithm evolution predictive analytics multiple linear regression (MLR) artificial neural network (ANN)) were implemented study. study found that brute-force dominant most best models selecting important features MLR. Moreover, compared MLR, ANN provides more advantages regarding accuracy permits PM10. The overall results revealed RMSE value day Klang 20.728, while AE 15.69. Furthermore, Alam 10.004, 7.982. Finally, predicted can be concentrations. proposed a tool an early warning system giving air quality information local authorities order formulate air-quality-improvement strategies.
منابع مشابه
Wrapper Feature Selection
INTRODUCTION It is well known that the performance of most data mining algorithms can be deteriorated by features that do not add any value to learning tasks. Feature selection can be used to limit the effects of such features by seeking only the relevant subset from the original features (de Souza et al., 2006). This subset of the relevant features is discovered by removing those that are cons...
متن کاملWrapper for Ranking Feature Selection
We propose a new feature selection criterion not based on calculated measures between attributes, or complex and costly distance calculations. Applying a wrapper to the output of a new attribute ranking method, we obtain a minimum subset with the same error rate as the original data. The experiments were compared to two other algorithms with the same results, but with a very short computation t...
متن کاملImproving performance for classification with incomplete data using wrapper-based feature selection
Missing values are an unavoidable problem of many real-world datasets. Inadequate treatment of missing values may result in large errors on classification; thus, dealing well with missing values is essential for classification. Feature selection has been well known for improving classification, but it has been seldom used for improving classification with incomplete datasets. Moreover, some cla...
متن کاملFeature Selection For The Prediction Of Tropospheric Ozone Concentration Using A Wrapper Method
High concentrations of ozone (O3) in the lower troposphere increase global warming, and thus affect climatic conditions and human health. Especially in metropolitan cities like Istanbul, ozone level approximates to security levels that may threaten human health. Therefore, there are many research efforts on building accurate ozone prediction models to develop public warning strategies. The goal...
متن کاملWrapper-Filter Feature Selection Algorithm Using a Memetic Framework
This correspondence presents a novel hybrid wrapper and filter feature selection algorithm for a classification problem using a memetic framework. It incorporates a filter ranking method in the traditional genetic algorithm to improve classification performance and accelerate the search in identifying the core feature subsets. Particularly, the method adds or deletes a feature from a candidate ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Sustainability
سال: 2022
ISSN: ['2071-1050']
DOI: https://doi.org/10.3390/su141811403